🐿️ ScourBrowse
LoginSign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
🧠 LLM Inference

Quantization, Attention Mechanisms, Batch Processing, KV Caching

Understanding LLMs: Insights from Mechanistic Interpretability
lesswrong.com·12h
🏆LLM Benchmarking
From Multi-Head to Latent Attention: The Evolution of Attention Mechanisms
vinithavn.medium.com·23h·
Discuss: Hacker News
🎯Vector Quantization
LongCat-Flash, a language model with 560B total parameters, MoE architecture
github.com·3h·
Discuss: Hacker News
💾Prompt Caching
VGG v GoogleNet: Just how deep can they go?
mayberay.bearblog.dev·22h
📊Embeddings
🌟Introducing Art-0-8B: Reasoning the way you want it to with Adaptive Thinking🌟
huggingface.co·19h·
Discuss: r/LocalLLaMA
🆕New AI
BRILLIANT @GoogleDeepMind research.
threadreaderapp.com·1h
📊Embeddings
The Art of Transformer Programming (2023)
yanivle.github.io·6h·
Discuss: Hacker News
💻Programming languages
5 Prompting Techniques That Changed My Life as an AI Engineer (and Everyday AI User)
pub.towardsai.net·11h
🪄Prompt Engineering
Guide to Contrastive Learning: Techniques, Models, and Applications
medium.com·15h·
Discuss: Hacker News
📊Embeddings
Knowledge and data-driven two-layer networking for accurate metabolite annotation in untargeted metabolomics
nature.com·19h
🍄Mycorrhizal Networks
Creating the brain behind dumb models
reddit.com·5h·
Discuss: r/LocalLLaMA
🚀LanceDB
Why AI Alone Fails at Large-Scale Code Modernization
thenewstack.io·13h
👨‍💻Software development practices
The joint estimation of uncertainty and its relationship with psychotic-like traits and psychometric schizotypy
nature.com·3h
🔍AI Interpretability
Artificial neuron merges DRAM with MoS₂ circuits to better emulate brain-like adaptability
techxplore.com·17h
⚡Hardware Acceleration
Applied AI Fundamentals: Structured Outputs
ouachitalabs.com·14h·
Discuss: Hacker News
🪄Prompt Engineering
Huawei develops AI inference solution to reduce reliance on foreign HBM chips
basanzietech.blogspot.com·15h·
Discuss: Hacker News
📊Model Serving Economics
Ensemble Learning
en.wikipedia.org·13h·
Discuss: Hacker News
📊Statistical Ranking
Neglecton Particles Could Be Key to More Stable Quantum Computers
scientificamerican.com·18h
🌳Data Structures
Despite the hype, generative AI hasn’t outshined humans in creative idea generation
nordot.app·11h
🎭Claude
Abstract Machine Models Also: what Rust got particularly right
dr-knz.net·2h·
Discuss: Hacker News
💻Programming languages
Loading...Loading more...
AboutBlogChangelogRoadmap